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Abstract — In this paper, pilot-symbol-assisted transmission in 
cognitive radio systems over time selective flat fading channels 
is studied. It is assumed that causal and noncausal Wiener filter 
estimators are used at the secondary receiver with the aid of 
training symbols to obtain the channel side information (CSI) 
under an interference power constraint. Cognitive radio model 
is described together with detection and false alarm probabilities 
determined by using a Neyman-Person detector for channel 
sensing. Subsequently, for both filters, the variances of estimate 
errors are calculated from the Doppler power spectrum of the 
channel, and achievable rate expressions are provided considering 
the scenarios which are results of channel sensing. Numerical 
results are obtained in Gauss-Markov modeled channels, and 
achievable rates obtained by using causal and noncausal filters 
are compared and it is shown that the difference is decreasing 
with increasing signal-to-noise ratio (SNR). Moreover, the optimal 
probability of detection and false alarm values are shown, and 
the tradeoff between these two parameters is discussed. Finally, 
optimal power distributions are provided. 

I. Introduction 

The need for the efficient use of the scarce spectrum in 
wireless applications has led to significant interest in the 
analysis of cognitive radio systems which are proposed as 
systems that have the potential to operate in the licensed 
spectrum without disturbing the primary users and hence better 
utilize the spectrum [1]. Since it is very important for the 
cognitive users not to interfere with the primary users during 
their transmission, detection of primary users' activity in the 
channel is the main concern in cognitive radio research. In [2] 
and [3] the authors developed an optimal strategy for oppor- 
tunistic spectrum access by maximally utilizing spectrum op- 
portunities in cognitive radio networks with multiple potential 
channels. Note that spectrum sensing brings along an essential 
decrease in the throughput of a system. Realizing this fact, the 
authors in [4] studied the tradeoff between channel sensing and 
throughput considering Shannon capacity as the throughput 
metric. Providing primary users with a sufficient protection 
against interference, they formulated an optimization problem 
and found out the optimal sensing time. Motivated by jointly 
detecting signal energy levels over multiple frequency bands 
rather than considering one band at a time, Quan et al. in [5] 
introduced a novel wideband spectrum sensing technique. 

An important characteristics of wireless communications is 
that channel conditions vary over time randomly due to mobil- 
ity and changing environment If the channel conditions are not 
known a priori, training sequences are frequently employed to 
perform channel estimation. Early studies conducted by Cavers 



[6] and [7] provided an analytical approach in pilot-assisted 
transmissions. In [8], Hassibi and Hochwald optimized the 
power and duration of training signals by maximizing a ca- 
pacity lower bound in multiple-antenna Rayleigh block fading 
channels. Ohno and Giannakis [9] focused on a capacity lower 
bound and optimized the spacing of training symbols and 
training power by employing a noncausal Wiener filter for 
channel estimation at the receiver in slowly-varying channels. 
In their studies [10], Abou-Faycal et al. employed a causal 
Kalman filter at the receiver in which all the past pilot symbols 
are used to estimate the channel coefficients. More recently, 
we in [11] jointly optimized the pilot symbol period and power 
allocation among pilot and data symbols by maximizing the 
achievable rates in Gauss-Markov fading channels, and in [12] 
we used causal and noncausal Wiener filters at the receiver to 
estimate the channel coefficients. 

In this paper, we study the training-based transmission 
schemes for cognitive radio systems in time-selective Rayleigh 
fading channels where cognitive radios are able to transmit 
data as long as they do not disturb the primary (licensed) 
users of the channel. We employ causal and noncausal Wiener 
filters at the receiver to obtain the estimates of the channel 
fading coefficients. We maximize a capacity lower bound by 
optimizing the training parameters. Note that even though the 
treatment is in general, we provide our numerical results for 
Gauss-Markov channel models. 

II. Cognitive Channel Model and Channel 
Sensing 

In this paper, we consider a cognitive radio channel model 
communicating over a time-selective Rayleigh flat fading 
channel in which a secondary transmitter attempts to send 
information to a secondary receiver, probably interfering with 
the primary users. Initially, the secondary users perform chan- 
nel sensing, and then depending on the primary users' activity, 
the secondary transmitter selects its transmission power, i.e., 
when the channel is busy, the average symbol power is Pi, 
and when the channel is idle, the average symbol power 
is Pi- For example, if P\ = 0, the secondary transmitter 
stops transmission when the primary users are sensed to 
be active. During the transmission, the discrete-time channel 
input-output relation in the symbol duration is given by 

yk = h k x ltk +n k + s Ptk fc = l,2, ... (1) 



if the primary users are in the channel. On the other hand, if 
the primary users are idle in the channel, we have 



yk = h k x 2 .k+n k k = 1,2, 



(2) 



In the above equations, and x 2 .k are the complex-valued 
channel inputs, and yk is the complex-valued channel output, 
and {nk} is assumed to be a sequence of independent and 
identically distributed (i.i.d.) zero mean Gaussian random 
variables with variance a\. In (1) and (2), hk denotes the 
fading coefficient between the secondary transmitter and the 
secondary receiver, which is assumed to be a zero-mean Gaus- 
sian random process with power spectral density Sh{e^ w ). It 
is further assumed that x k is independent of hk and rife. While 
both the secondary transmitter and the secondary receiver 
know the channel statistics, neither has prior knowledge of 
instantaneous realizations of the fading coefficients. In (1), 
Sp.k represents the sum of the active primary users' faded 
signals arriving at the secondary receiver. We assume that the 
bandwidth available in the system is B, and the channel is 
sensed every first N seconds of the block which is T seconds. 
Note that the discrete-time model is obtained by sampling the 
received signal every T s = -g seconds. 

If the transmission strategies of the primary users are not 
known, energy-based detection methods are well-suited for the 
detection of the activities of primary users. Following the same 
approach as in [13], we consider a hypothesis testing problem 
between the noise nk and the signal s p ^ in noise. Noting that 
there are NB complex symbols in a duration of N seconds, 
this can mathematically be expressed as follows: 



"Ho : Vk=n k , k = l,...,NB 
Hi : yk=s p< k+n k , k = l,.. 



(3) 



, NB. 



Considering the above detection problem, the optimal 
Neyman-Pearson detector is given by [14] as 



NB — l 



i 

NB 



(4) 



k=0 



where A is the detection threshold. We assume that s p ^ has a 
circularly symmetric complex Gaussian distribution with zero- 
mean and variance cr^. Assuming further that {s Py k} are i.i.d., 
we can immediately conclude that the test statistic Y is chi- 
square distributed with 2NB degrees of freedom. In this case, 
the probabilities of false alarm and detection can be established 
as follows: 



P f = Pr(Y > X\H ) = 1 - P ( - L ^A..\ 



P d = Pr(Y >\\H 1 ) = 1-P 



( NBA 



, NB 



(5) 



(6) 



where P(x, a) denotes the regularized lower gamma function 
and is defined as P(x, a) = where j(x, a) is the lower 

incomplete gamma function and T(a) is the Gamma function. 

In the above hypothesis testing problem, another approach 
is to consider Y as Gaussian distributed, which is accurate 
if NB is large [4]. In this case, the detection and false 
alarm probabilities can be expressed in terms of Gaussian Q- 



functions. We would like to note the rest of the analysis in the 
paper does not depend on the specific expressions of the false 
alarm and detection probabilities. However, numerical results 
are obtained using (5) and (6). 

III. Pilot Symbol Assisted Modulation and 
Channel Estimation at the Receiver 

We consider pilot-assisted transmission where periodically 
inserted pilot symbols, known by both the sender and the 
receiver, are used to estimate the fading coefficients of the 
channel. We assume a simple scenario where a pilot symbol 
is transmitted every T seconds after channel sensing, and 
(T — N)B — 1 data symbols following the pilot symbol 
are transmitted between the secondary transmitter and the 
secondary receiver. We consider the following average power 
constraints: 

(/+1)TS-1 

^ E l*i,*| 2 < ? i * = ■•■• -2,-1,0,1,2,... (7) 

k=lTB+NB 

when the channel is detected as busy. On the other hand, when 
the channel is detected as idle the power constraint becomes 



(Z+1)TS-1 

y E \^,k\ 2 <P 2 l = ... ,-2,-1,0,1,2,... (8) 

k=lTB+NB 

Therefore, the total number of symbols transmitted in one 
block is (T - N)B, and the total energy is limited by TPx 
and TP 2 for the cases when the channel is busy and idle, 
respectively. 

Communication takes place in two phases. In the training 
phase, the transmitter sends pilot symbols and the receiver 
estimates the channel coefficients. In this phase, the channel 
output is given by 

VlTB+NB = hiTB+NB\f~Pt + n lTB+N B + Sp,lTB+NB (9) 

when the channel is busy, and 

VlTB+NB = hiTB+NB\/~Pt + niTB+NB (10) 

when the channel is idle. P t is the power allocated to the 
pilot symbol. In the data transmission phase, data symbols 
are transmitted. In this phase, input-output relationship can be 
written as 

Vk = hkxx.k+hkxx.k+rik+sp.k ITB+NB <k< (l+l)TB-l 



(11) 



when the channel is busy, and 



Vk = h k x 2 .k + h k x 2 .k + nk ITB + NB <k< (l + l)TB-l 

when the channel is idle where hk and hk are the estimated 
channel coefficient and the error in the estimate at sample time 
k, respectively. Note that h k and h k for 1TB + NB < k < 
(l+l)TB — l are uncorrected zero-mean circularly symmetric 
complex Gaussian random variables with variances a% and 

2-i k 

<r~ , respectively. 

Let us assume that C is the set of integers, I, such that 
outputs at time samples 1TB + NB are used to estimate the 
channel fading coefficient at time k. Due to periodicity, we will 



2 



consider only the cases where k = NB, NB + 1, . . . , TB — 1. 
We consider a causal and a non-causal estimation procedure. 

IV. Achievable Rates 

Considering the decision of channel sensing and its correct- 
ness, we have four possible scenarios: 

1) Channel is busy, detected as busy, 

2) Channel is busy, detected as idle, 

3) Channel is idle, detected as busy, 

4) Channel is idle, detected as idle. 

We assume that a Wiener filter, known as the optimum linear 
estimator in the mean-square sense, is used at the secondary 
receiver to estimate the fading coefficients of the channel 
between the secondary transmitter and the secondary receiver. 
Since the pilot symbol is sent every T seconds, the channel 
is sampled every T seconds. Therefore, we have to consider 
the under-sampled version of the channel's Doppler spectrum 
given by 



TB-1 

£< 

i=0 



■>S h (e-^™— 



Also as shown in [12], [15] and [9], it can be easily obtained 
that the channel MMSE for the noncausal Wiener filter at time 
1TB + NB + m is given by 

2 P t r \S h ,m(e jw )\ 2 
n > 27 ' ' 



4 

itTBL + N B + n 



v P t S hfi {e^)+al + pal 



-dw. 



(14) 

where p is the activity probability of primary users in the 
channel. As for the causal Wiener filter, using a similar 
approach as in in [12] and [15], MMSE at time ITB+NB+m 
is given by 

l {en\ 2 



ftlTB + N B + r, 



P 
~2tt 

1 

'2tt 



" \S^ 

7t PtS h , (e^ 



■dw 



P 



S h . m {e' w ) 
L*{e' w ) 



2 



dw. 



(15) 



where L*(e jw ) is obtained from the canonical factorization of 
the channel output's sampled power spectral density at m = 0, 
given by 



PtS h , (en + ol + pal = r e L(e Jw )L*(en- 



(16) 



The operators {} + and {}_ give the causal and anti-causal 
parts of the function to which they are applied, respectively. 
Using the orthogonality principle we have 



a 



(17) 



hlTB + NB + m '" tl lTB + TB + n 

where <r~ is the variance of the channel estimate at 

hlTB + NB + m 

time 1TB + NB + m. Similarly as in [12], we regard the 
errors in (11) and (12) as another source of additive noise and 
assume that 

wi,k = hkXi t k + rik + s Pi k (18) 



and 



w 2 ,k = h k x 2 ,k + n k 



(19) 



are zero-mean Gaussian noises with variances 
when the channel is busy, and 

2 2 



~2 2 o I 2 



(20) 



(21) 



when the channel is idle, respectively. Considering the errors 
in (11) and (12) and the scenarios given above, we obtain the 
following lower bound on the channel capacity: 



C> 



^ (T~N)B 



m—1 



E^pP d log 



+ p(l-P d )log 1 



). (13) +(l-p)P/log 1 




+ (l-p)(l-P / )log 1 + 



where £ is a zero-mean, unit variance, circularly sym- 
metric complex Gaussian random variable, and Pi -m = 
E^XLiTB+NB+m] 2 } and P 2 , m = E [\x 2 .iTB+NB+m\ 2 ] de- 
note the power of m data symbol after the pilot symbol when 
the channel is busy and idle, respectively. Note that the error 
variance a~ depends on m or the location of data 

hlTB+NB+m 

symbol with respect to the pilot symbol. However, if the fading 
is slowly varying and the channel sampled fast enough, we 
can satisfy fjj < -j^ or wd < -§Ff where fu is the maximum 
Doppler frequency and Bwd — 27t/d. In this case, we can 
see from Nyquist's Theorem that there is no aliasing in the 
under-sampled version of the channel's Doppler spectrum, and 
hence \S h , m (e^)\ = |S M (e*»)| = S h (e^' T / B )/T '/ B, for 
m E [1,TB] and — n <w<ir. Henceforth, (14) becomes 

Pt f v \Sh,m(e jw )\ 2 



hlTB + N B + n 



a 2 Pt 



2k .L n PtS h ,o(e^) + al+pa 2 Si 
\S h (e^)\ 2 



P t S h (e^)+TB(a 2 +pal) 



dw 



■dw 



4 



(23) 



and (15) becomes 

S h , m (e jw 



hlTBJ, 



\Sh, m (e- 



,jw\\2 



P t S hfi (e^)+a*+pal 



L*{ei w ) 

P 
2tt 



dw 



+ 



2nr e (TB) 2 



P t S h (e?*>) 
S h (e$ 



L*{ei w ) 




(24) 
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where 



TB 



r e L((? w )L*(e? w ). 



(25) 



Therefore, under this assumption, the error variances become 
independent of m. Since the estimate quality is the same for 
each data symbol regardless of its position with respect to 
the pilot symbol, uniform power allocation among the data 
symbols is optimal, and we have 



Px.m — 



and 



P 



2,m 



TP! - P t 

(T - N)B - 1 

TP 2 - Pt 
(T - N)B - 1 



= Pi 



Pi 



Then, (22) becomes 



,9(1 -P d ) log [l 

(1- p)P f log 1 1 



P X *l 



(1- P )(l-P f )\0g 1 




(26) 



V. Numerical Results in Gauss-Markov Channels 



In this section, we consider Gauss-Markov fading process 
whose dynamics is given as 



hk=ahk-x+Zk 0<a<l k = 1,2,3,.. 



(27) 



where a is a parameter that controls the rate of channel vari- 
ation, and {zk} are i.i.d. circular complex Gaussian variables 
with zero mean and variance (1 — ol)o\. The power spectral 
density of this channel is given as 



S h (e> w ) 



(1 



cr erf 



1 + a 2 — 2a cos(w) 



— 7T < w < 71". 



(28) 



Note that S h (e jw ) in (28) is not band-limited. Therefore, the 
condition Wd < -g^ can be satisfied only when TB = 1 
which is not a reliable strategy. On the other hand, if the fading 
is varying slowly and the value of a is close to 1, the Doppler 
spectrum Sh(e : ' w ) decreases sharply for large frequencies and 
most of the energy is kept around the lower frequencies. 
We can easily find that the frequency ranges [— 7r/49, 7r/49], 
[— 7r/9,7r/9] and [— 7r/4, 7r/4] contain more than 90% of the 
power when a = 0.99, 0.95 and 0.90, respectively. Therefore, 
in these cases the effect of aliasing is negligible. 

We can easily obtain the error variance for the noncausal 
Wiener filter from (23). On the other hand, we have to perform 
canonical factorization in order to obtain the error variance 
for the causal Wiener filter in the absence of aliasing. Due to 
limited space in the paper, we will skip some of the calculation 



— Noncausal Wiener Filter Estimation 
■ - Causal Wiener Filter Estimation 





SNRdB 



Fig. 1. SNR dB v.s. Data Rate (Nats/Sec) when B = 100Hz, T = 0.5sec, 
N = O.lsec, a = 0.99, P d = 0.91, and P f = 0.23. 



steps. We can express the anti-causal part of (24) as 



S h (e*&) 
L* {ei w ) 



;i - a 2 ) a\v 



C TB 



1 — av 



^1 - we™ j 



where 



and 



P_ 

TB 



c 2 — 4a 2 ( <t 2 + pa 2 



a 2 ) a 2 + (1 + a 2 ) ( CT , 2 



Note that 



L (e> w ) 



1 — ve tb 



(29) 



(30) 



(31) 



(32) 



(33) 



1 — ae tb 

Using (29) and (31), we can obtain the error variance for the 
causal Wiener filter as 



2tt 



\S h (en\ 2 



P t S h (e^)+TB(a 2 +pa 2 p ) 
{l-a 2 ) 2 aiv 2 



TBr e {l - av) 2 (l + v 2 - 2v cos(iu)) 



(hv 



(34) 



Since the primary users are not to be disturbed by the 
secondary users, when the primary users are active there is an 
average transmission power threshold on the secondary users 
which is denoted as I avg . Therefore, the power constraint can 
be expressed as follows 

Pt + [(T - N)B - 1] [P d P! + (1 - P d )P 2 ] < TI avg (35) 

and the average signal-to-noise (SNR) is 



SNR- 



(36) 



In Figure 1, employing a causal Wiener filter and a non- 
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Fig. 2. Data Rate and Probability of False Alarm, Pt v.s. Probability of 

Detection, P d when B = 100Hz, T = 0.5sec, N = O.lsec, and a = 0.99. 



causal Wiener filter at the secondary receiver, we show the 
data rate as a function of SNR (dB) when B = 100Hz, and 
T = 0.5 seconds and N = 0.1 seconds. We choose Pd = 0.91 
and Pf = 0.23 since in reality these are plausible and 
obtainable values considering the tradeoff between probability 
of detection and false alarm. Note that when Pd = 1, there will 
be no constraint on P2 and the secondary user can use power 
as much as its peak value allows. As expected, we obtain 
higher data rates when we use a noncausal Wiener filter. Note 
that the difference between the data rates obtained by using 
a causal and a noncausal Wiener filter is decreasing with the 
increasing SNR. In Fig. 2, we compare the data rate values 
for different SNR values, dB and 10 dB, as a function of 
probability of detection, Pd, again employing a causal and a 
noncausal Wiener filter at the secondary receiver. It is clearly 
seen that the highest data rates are obtained when Pd = 0.91 
and Pj — 0.23 for both Wiener filters. Again, as expected the 
data rates obtained using noncausal Wiener filters are higher 
than the ones obtained by causal Wiener filters. In Fig. 3, 
we display the optimal power distribution as a function of 
probability of detection, Pd. Recall that P t denotes the power 
allocated to the pilot symbol, and Pi and P2 represent the 
power allocated to the data symbols when the channel is busy 
and idle, respectively. In Fig. 3, it is easily observed that 
the power allocated to the data symbols when the channel 
is busy or idle is decreasing with decreasing Pd, since the 
interference power constraint is mainly over P2 and there is 
less transmission with Pi. Furthermore, the power allocated 
to the pilot symbol is the highest when Pd = 0.91, where at 
the same time Pf = 0.23. 

VI. Conclusion 

We have studied channel sensing, pilot-symbol assisted 
channel estimation, and achievable rates when causal and non- 
causal Wiener filters are employed at the secondary receiver. 
We have obtained achievable rates by finding error variances 
in both cases, and shown the effects of channel sensing 
over the achievable rates. Subsequently, we have optimized 
power allocated to the pilot and data symbols. Finally, we 
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Probability of Detection, P d 

Fig. 3. The optimal power distribution among the pilot and data symbols 
when B = 100Hz, T = 0.5sec, N = O.lsec, and a = 0.99. 



have obtained numerical results in Gauss-Markov channels 
showing the optimal parameter values. We have compared 
the performances of causal and noncausal Wiener filters at 
different SNR, P d and Pf values. 
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